Potential of Dynamic Binary Parallelization

نویسندگان

  • Jing Yang
  • Kevin Skadron
  • Mary Lou Soffa
  • Kamin Whitehouse
چکیده

As core counts continue to grow in modern microarchitectures, automatic parallelization technologies are becoming increasingly important to fill the gap between hardware that has increased parallelism and software that is still designed for sequential execution. In previous research, we have proposed a novel dynamic binary parallelization scheme called T-DBP, which leverages hot traces to provide a large instruction window without introducing spurious control and data dependencies. In this paper, we conduct a limit study to estimate the maximum possible performance of T-DBP on the SPEC CPU2000 benchmark suite. Our results indicate an average speedup of 9.18x and 22.34x over sequential execution for the integer and floating point benchmarks, respectively. We also explain this high speed increase by quantitatively demonstrating that T-DBP uses runtime information to overcome two key handicaps of compile-time parallelization techniques. By artificially emulating the effects of these handicaps in T-DBP, the average speedup shrinks to 4.51x (integer) and 9.36x (floating point), respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Trace-Based Dynamic Binary Parallelization

With the number of cores increasing rapidly but the performance per core increasing slowly at best, software must be parallelized in order to improve performance. Manual parallelization is often prohibitively time-consuming and error-prone (especially due to data races and memory-consistency complexities), and some portions of code may simply be too difficult to understand or refactor for paral...

متن کامل

Feasibility of Dynamic Binary Parallelization

This paper proposes DBP, an automatic technique that transparently parallelizes a sequential binary executable while it is running. A prototype implementation in simulation was able to increase sequential execution speeds by up to 1.96x, averaged over three benchmarks suites.

متن کامل

Dynamic Parallelization and Vectorization of Binary Executables on Hierarchical Platforms

As performance improvements are being increasingly sought via coarse-grained parallelism, established expectations of continued sequential performance increases are not being met. Current trends in computing point toward platforms seeking performance improvements through various degrees of parallelism, with coarse-grained parallelism features becoming commonplace in even entry-level systems. Ye...

متن کامل

Automated detection of structured coarse-grained parallelism in sequential legacy applications

The e cient execution of sequential legacy applications on modern, parallel computer architectures is one of today’s most pressing problems. Automatic parallelization has been investigated as a potential solution for several decades but its success generally remains restricted to small niches of regular, array-based applications. This thesis investigates two techniques that have the potential t...

متن کامل

Dynamic and Speculative Polyhedral Parallelization of Loop Nests Using Binary Code Patterns

Speculative parallelization is a classic strategy for automatically parallelizing codes that cannot be handled at compile-time due to the use of dynamic data and control structures. Another motivation of being speculative is to adapt the code to the current execution context, by selecting at run-time an efficient parallel schedule. However, since this parallelization scheme requires on-the-fly ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012